Using the R Package crlmm for Genotyping and Copy Number Estimation.
نویسندگان
چکیده
Genotyping platforms such as Affymetrix can be used to assess genotype-phenotype as well as copy number-phenotype associations at millions of markers. While genotyping algorithms are largely concordant when assessed on HapMap samples, tools to assess copy number changes are more variable and often discordant. One explanation for the discordance is that copy number estimates are susceptible to systematic differences between groups of samples that were processed at different times or by different labs. Analysis algorithms that do not adjust for batch effects are prone to spurious measures of association. The R package crlmm implements a multilevel model that adjusts for batch effects and provides allele-specific estimates of copy number. This paper illustrates a workflow for the estimation of allele-specific copy number and integration of the marker-level estimates with complimentary Bioconductor software for inferring regions of copy number gain or loss. All analyses are performed in the statistical environment R.
منابع مشابه
R/Bioconductor software for Illumina's Infinium whole-genome genotyping BeadChips
UNLABELLED Illumina produces a number of microarray-based technologies for human genotyping. An Infinium BeadChip is a two-color platform that types between 10(5) and 10(6) single nucleotide polymorphisms (SNPs) per sample. Despite being widely used, there is a shortage of open source software to process the raw intensities from this platform into genotype calls. To this end, we have developed ...
متن کاملGenotyping with the crlmm Package
The crlmm package contains a new implementation for the CRLMM algorithm (Carvalho et. al. 2007). Our focus is on efficient genotyping of SNP 5.0 and 6.0 Affymetrix arrays, although extensions of the method are under development for similar platforms. This implementation, compared to the previous one (in oligo), offers improved confidence scores, quality scores for SNP’s and batches, higher accu...
متن کاملVanillaICE : Hidden Markov Models for the Assessment of Chromosomal Alterations using High-throughput SNP Arrays
The starting point for this section of the vignette are B allele frequencies and log R ratios that are available from software such as GenomeStudio and the R package crlmm. In this section, we assume that the low-level summaries are available in a plain text file – one file per sample. For users of the crlmm package for preprocessing, please refer to the crlmmDownstream vignette. To illustrate ...
متن کاملA multilevel model to address batch effects in copy number estimation using SNP arrays.
Submicroscopic changes in chromosomal DNA copy number dosage are common and have been implicated in many heritable diseases and cancers. Recent high-throughput technologies have a resolution that permits the detection of segmental changes in DNA copy number that span thousands of base pairs in the genome. Genomewide association studies (GWAS) may simultaneously screen for copy number phenotype ...
متن کاملDetection of de novo copy number alterations in case-parent trios using the R package MinimumDistance
For the analysis of case-parent trio genotyping arrays, copy number variants (CNV) appearing in the offspring that differ from the parental copy numbers are often of interest (de novo CNV). This package defines a statistic, referred to as the minimum distance, for identifying de novo copy number alterations in the offspring. We smooth the minimum distance using the circular binary segmentation ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of statistical software
دوره 40 12 شماره
صفحات -
تاریخ انتشار 2011